Towards Realistic Performance Bounds for Implicit CFD Codes
نویسندگان
چکیده
The performance of scientific computing applications often achieves a small fraction of peak performance [7,17]. In this paper, we discuss two causes of performance problems— insufficient memory bandwidth and a suboptimal instruction mix—in the context of a complete, parallel, unstructured mesh implicit CFD code. These results show that the performance of our code and of similar implicit codes is limited by the memory bandwidth of RISC-based processor nodes to as little as 10% of peak performance for some critical computational kernels. Limits on the number of basic operations that can be performed in a single clock cycle also limit the performance of “cache-friendly” parts of the code.
منابع مشابه
Automatic Differentiation of Advanced Cfd Codes for Multidisciplinary Design
Automated multidisciplinary design of aircraft and other ight vehicles requires the optimization of complex performance objectives with respect to a number of design parameters and constraints. The e ect of these independent design variables on the system performance criteria can be quanti ed in terms of sensitivity derivatives which must be calculated and propagated by the individual disciplin...
متن کاملOn the Performance of Cooperative Diversity Protocols in Practical Wireless Systems
The concepts of cooperative diversity promise to offer the benefits of spatial diversity gains to handheld wireless devices with single antennas. The information-theoretic bounds that have been established recently serve as basic guidelines; yet, the performance of such protocols should additionally be examined for more realistic assumptions. Towards this end, we study cooperative diversity pro...
متن کاملAdvanced Optimizations of An Implicit Navier-Stokes Solver on GPGPU
General-purpose computing on graphics processing units (GPGPU) is a massive fine-grain parallel computation platform, which is is particularly attractive for CFD tasks due to its potential of one or two magnitudes of performance improvement with relatively low capital investment. Many successful attempts have been reported in recent years (see, for example [1, 2, 3, 4, 5, 6]). Although early at...
متن کاملThe Next Four Orders of Magnitude in Performance for Parallel CFD
While some simulations whose computational work requirements are superlinear in memory requirements have executed at 1 Teraflop/s, simulations of PDE-based systems remain “mired” in the hundreds of Gigaflop/s on the same machines. We briefly review the algorithmic structure of typical PDE-based CFD codes that is responsible for this situation and consider possible architectural and algorithmic ...
متن کاملHigh Performance 3D CFD Codes for Complex Piston Engine Applications
Economic (fuel price), political (energetic independence), health (cancerous pollutants) and environmental (green house effect) concerns have greatly contributed to the evolution of pollutant emission regulations and fuel consumption regulations in the last decade. A better understanding of the combustion process for operating conditions near the stability limit will probably be the next step t...
متن کامل